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r*i ■ We show that Naming- the existence of distinct IDs known to all- is a hidden, but necessary, 

D ' assumption of Herlihy's universality result for Consensus. We then show in a very precise sense that 

Naming is harder than Consensus and bring to the surface some important differences existing between 
popular shared memory models. 

1 Introduction 

Q; 

The consensus problem enjoys a well-deserved reputation in the (theoretical) distributed computing com- 
munity. Among others, a seminal paper of Herlihy added further evidence in support of the claim that 



CN consensus is indeed a key theoretical construct [13|. Roughly speaking, Herlihy's paper considers the 

following problem: Suppose that, besides a shared memory, the hardware of our asynchronous, parallel 

machine is equipped with objects (instantiations) of certain abstract data types T\,T2, . . . ,T^; given this, 

7— I is it possible to implement objects of a new abstract data type Y in a fault-tolerant manner? The notion 
O " 



of fault-tolerance adopted here is that of wait-freedom, i.e. (n — l)-resiliency [13]. This question is the 
starting point of an interesting theory leading to many results and further intriguing questions (see fll3| , |l5| 
among others). One of the basic results of this theory, already contained in the original article of Herlihy, 
can be stated, somewhat loosely, as follows: If an abstract data type X, together with a shared memory, 
is powerful enough to implement consensus for n processes in a fault-tolerant manner then, X, together 
with a shared memory, is also powerful enough to implement in a fault-tolerant manner for n processes 
any other data structure Y. This is Herlihy's celebrated universality result for consensus. 

In this paper we perform an analysis of some of the basic assumptions underlying Herlihy's result and 
discover several interesting facts which, in view of the above, are somewhat counter-intuitive and that could 
provocatively be summarized by the slogans "consensus without naming is not universal" and "naming 
with randomization is universal." To state our results precisely we shall recall some definitions and known 
results. 

In the consensus problem we are given a set of n asynchronous processes that, as far as this paper is 
concerned, communicate via a shared-memory. Every process has its own input bit and is to produce its 
own output bit. Processes can suffer from crash failures. The problem is to devise a protocol that can 
withstand up to (n — 1) crash failures, i.e. a wait-free protocol, satisfying the following conditions: 

• Every non- faulty process terminates; 

• All output bits are the same and, 
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• The output bit is the input bit of some process. 

The naming problem on the other hand, is as follows: Devise a protocol for a set of n asynchronous 
processes such that, at the end, each non faulty process has selected a unique identifier (key). If processes 
have identifiers to start with then we have the renaming problem. 

In some sense, this paper is about the relative complexity of naming to consensus, and viceversa. 
We shall mostly concern ourselves with probabilistic protocols- every process in the system, modeled 
as an i/o automaton, has access to its own source of unbiased random bits- for systems consisting of 
asynchronous processes communicating via a shared memory. The availability of objects of abstract 
data type consensus and naming is assumed. An object of type consensus is a subroutine with input 
parameter b € {0, 1}. When invoked by a process p a bit b' is returned. This bit is the same to all invoking 
processes and is equal to some of the input bits, i.e. if b' is returned some p must have invoked the object 
with input parameter b'. An object of type naming is a subroutine without input parameters that, when 
invoked by a process p, returns a value v p £ {l,..,n}, n being the overall number of processes. For any 
two processes p ^ q we have that v p ^ v q . 

The protocols we devise should be wait-free in spite of the adversary, the "malicious" non-deterministic 
scheduling agent (algorithm) modeling the environment. The adversary decides which, among the currently 
pending operations, goes on next. Pessimistically one assumes that the adversary is actually trying to force 
the protocol to work incorrectly and that the next scheduling decision- which process moves next- can be 
based on the whole past history of the protocol execution so far. This is the so-called adaptive or strong 
adversary. In contrast, sometimes it is assumed that the adversary decides the entire execution schedule 
beforehand. This is the so-called oblivious or weak adversary. 

In the literature two shared- memory models are widespread. The first assumes multiple reader - 
multiple writer registers. In this model each location of the shared memory can be written and read by any 
process. The other model assumes multiple reader - single writer registers. Here, every register is owned 
by some unique process, which is the only process that can write on that register, while every process 
is allowed to read the contents of any register. In both models reads and writes are atomic operations; 
in case of concurrent access to the same register it is assumed that the adversary "complies with" some 
non-deterministic, but fair, policy. In this paper we shall refer to the first as the symmetric memory 
model and to the second as the asymmetric memory model. 

We are now ready to state the results of this paper. Let us start by restating Herlihy's universality 
result in our terminology. 

Theorem. [Herlihy] Suppose that n asynchronous processes interact via a shared memory and that, 

(i) Memory is symmetric; 

(ii) Each process has its own unique identifier; 

(iii) Objects of type consensus are available to the processes. 

Then, any abstract data type T can be implemented in a wait-free manner for the n processes. 
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The first question we consider in this paper is: What happens if the second hypothesis is removed? 
Can distinct identifiers be generated from scratch in this memory model? The answer is negative, even 
assuming the availability of consensus objects. 

Proposition 1 [Naming is impossible] Suppose that n asynchronous processes without identifiers interact 
via a shared memory and that, 

(i) Memory is symmetric; 

(ii) Each process has access to its own source of unbiased random-bits; 

(iii) Objects of type consensus are available to the processes; 

(iv) The adversary is weak. 

Yet, wait-free Las Vegas naming is impossible. 

This result is simple to prove, but it is interesting in several respects. First, it says that in a model more 
powerful than Herlihy's, no protocol can produce distinct identifiers with certainty. Therefore consensus 
by itself is not universal, for wait-free naming objects cannot be implemented in a wait-free manner with 
consensus alone. 

Recall that a Las Vegas protocol is always correct and that only the running time is a random variable, 
while for a Montecarlo protocol correctness too is a random variable. Note that Montecarlo naming is 
trivial- each process generates O(logn) many random bits and with probability 1 — o(l) no two of them 
will be identical. Therefore, at least at the outset, only the question of the existence of Las Vegas protocols 
is of interest. 

Proposition [l] shows that the power of randomization to "break the symmetry" is limited. If we start 
from a completely symmetric situation, it is impossible to generate identifiers that are surely distinct. 
In stark contrast with the previous result, as we prove in this paper, the following holds. 

Theorem 1 [Consensus is easy] Suppose that n asynchronous processes without identifiers interact via a 
shared memory and that, 

(i) Memory is symmetric; 

(ii) Each process has access to its own source of unbiased random-bits; 

(iii) The adversary is strong. 

Then, there exist Las Vegas, wait-free consensus protocols for n processes whose complexity is polynomial 
in expectation and with high probability. 

Notice that while Proposition p] establishes the impossibility of naming even against the weak adversary, 
here the adversary is strong. 

Incidentally, Theorem [l| shows that hypothesis (iii) of Proposition [l] is superfluos, for consensus objects 
can be simulated via software in a wait-free manner. It is well-known that hypothesis (ii) is necessary, even 
if the adversary is weak (see, for instance, |6|). 
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In some sense naming captures the notion of complete asymmetry among the processes and if we start 
from a completely symmetric situation it embodies the intuitive notion of complete break of symmetry. It 
is well-known that randomization is a powerful "symmetry-breaker" although, as Proposition [l] shows, not 
enough for naming, if we start from a perfectly symmetric situation. This leads to the question of "how 
much" asymmetry is needed for naming. Let us consider therefore asymmetric memory, assuming moreover 
that processes do not have access to the address (index) of their private registers. Formally, each process p 
accesses the m registers by means of a permutation ir p . Register Tr p - p's ith register- will always be the same 
register, but, for p ^ q, 7r p and 7r* might very well differ. In particular processes cannot obtain the physical 
address of the registers, therefore the asymmetry is somehow hidden from the processes. Although our 
motivation is mainly theoretical, this model has been used to study certain situations in large dynamically 



changing systems where a consistent indexing scheme is difficult or impossible to maintain [18|. Moreover 
this model could make sense in cryptographical systems where this kind of consistency is to be avoided. 

We show the following. If the memory is initialized to all O's (or any other fixed value) we say that it 
is initialized fairly. 

Proposition 2 Assume that the memory is initialized fairly. If processes are identical, deterministic i/o 
automata without identifiers then naming is impossible, even if memory is asymmetric and the adversary 
is weak. 

Thus, by themselves, neither randomization nor asymmetric memory can break the symmetry. What is 
needed is their acting together. 

Theorem 2 Suppose that n asynchronous processes without identifiers interact via a shared memory and 
that, 

(i) Memory is asymmetric and initialized fairly; 

(ii) Each process has access to its own source of unbiased random-bits; 
(hi) The adversary is strong. 

Then, there exist a Las Vegas, wait-free naming protocol for n processes whose running time is polynomial 
in expectation. Furthermore the key space from which identifiers are drawn has size n, which is optimal. 

Therefore, with randomization, asymmetric memory is inherently symmetry-breaking, whereas consensus 
is not. 



This result improves on previous work in [21] in which naming protocols with almost-optimal key range 
are given. We prove two versions of the above result. We first give a simple protocol whose running time 
is 0(n 2 logn) w.h.p. and a faster protocol, named squeeze, whose expected running time is 0(ralog 3 n). 
As a by-product we also show that an object we call selectWinner cannot be implemented by consensus 
alone, i.e. without naming, even if randomization is available and the adversary is weak. The semantics of 
selectWinner is the following: the object selects a unique winner among the invoking processes. 

Since any deterministic protocol must use a key range of size at least 2n — 1 in order to be wait-free 



[14], this is yet another instance in which randomization is more powerful than determinism as far as 



fault-tolerant computing is concerned. 
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Our results show, perhaps surprisingly, that multiple reader - single writer registers are more powerful 
than multiple reader - multiple writer registers, even though the latter might represent a faster alternative. 
This highlights an important difference between the two models. 

Our Theorem |l| is obtained by combining several known ideas and protocols, in particular those in p| 
and [fTcR . When compared to the protocol in Q it is, we believe, simpler, and its correctness is easier to 
establish (see, for instance, f22| ). Moreover, it works in the less powerful symmetric model and can deal 



with the strong adversary, whereas the protocol in [1C] can only withstand the "intermediate" adversary, 



whose power lies somewhere between the more traditional weak and strong adversaries we consider. From 
the technical point of view, our Propositions p] and || are essentially contained in [17] to which we refer for 
other interesting related results. Other related work can be found in @, O]. 

In spite of the fact that we make use of several known technical ingredients, our analysis, we believe, is 
novel and brings to light for the first time new and, we hope, interesting aspects of fundamental concepts. 

2 Consensus is Easy, Naming is Hard 

We start by outlining a consensus protocol assuming that (a) the memory is symmetric, (b) processes are 
i/o automata without identifiers which have access to their own source of (c) random bits. Our protocol 
is obtained by combining together several known ideas and by adapting them to our setting. The protocol, 
a randomized implementation of n-process binary consensus for symmetric memory, is a modification of 



the protocol proposed by Chandra [10|. The original protocol cannot be used in our setting since its 
shared coins require that processes have unique IDs. Thus, we combine it with a modification of the 
weak shared coin protocol of Aspnes and Herlihy ||]. The latter cannot be directly used in our setting 
either, since it requires asymmetric memory. Another difference is that, unlike in Chandra's protocol, we 
cannot revert to Aspnes' consensus [|J. In this paper we are only interested in establishing the existence 
of a polynomial protocol and make no attempt at optimization. Since the expected running time of our 
protocol is polynomial, by Markov's Inequality, it follows that the running time and, consequently, the 
space used are polynomial with high probability (inverse polynomial probability of failure). Conceivably 
super polynomial space could be needed. We leave it as an open problem whether this is necessary. In the 
sequel we will assume familiarity with the notion of weak shared coin of Q to which the reader is referred. 

The protocol, shown in Figure fj], is based on the following idea. Processes engage in a race of sorts by 
splitting into two groups: those supporting the value and those supporting the 1 value. At the beginning 
membership in the two "teams" is decided by the input bits. Corresponding to each team there is a 
"counter", implemented with a row of contiguous "flags"- the array of booleans Mark[, ]- which are to 
be raised one after the other from left to right by the team members, cooperatively and asynchronously. 
The variable position p of each process p records the rightmost (raised) flag of its team the process knows 
about. The protocol keeps executing the following loop, until a decision is made. The current team of 
process p is defined by the variable myTeam p . The process first increments its own team counter by raising 
the positiorip-th flag of its own team (this might have already been done by some other team member, 
but never mind). For instance, if p's team corresponds to the value b then, Mark [posiiiore p , b] is set to 
true. Thus, as far as process p is concerned, the value of its own team counter is position p (of course, this 
might not accurately reflect the real situation). The process then "reads" the other counter by looking at 
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the other team's row of flags at positions position p + l,position p ,position p — 1, in this order. There are 
four cases to consider: (a) if the other team is ahead the process sets the variable tentativeNewTeam p to 
the other team; (b) if the two counters are equal, the process flips a fair coin X € {0, 1} by invoking the 
protocol GetCoin5(, ) and sets tentativeNewTeamp to X; (c) if the other team trails by one, the process 
sticks to its team, and (d) if the other team trails by two (or more) the process decides on its own team 
and stops executing the protocol. The setting of tentativeNewTeamp is, as the name suggests, tentative. 
Before executing the next iteration, the process checks again the counter of its own team. If this has been 
changed in the meanwhile (i.e. if the (position p + l)-st flag has been raised) then the process sticks to 
his old team and continues; otherwise, it does join the team specified by tentativeNewTeamp. The array 
Mark[j, s] implemented with multiple reader - multiple writer registers, while the other variables 
are local to each process and accessible to it only. The local variables can assume only a finite (constant) 
set of values and can therefore be "hardwired" in the states of the i/o automaton representing the process. 

The only, but crucial, difference between our protocol and that of Chandra concerns procedure GetCoin^(, 
In Chandra's setting essentially it is possible to implement "via software" a global coin, thanks to the nam- 
ing assumption and the special assumption concerning the power of the adversary ( "intermediate" instead 
of strong). In the implementation in Figure [l], we use a protocol for a weak shared coin for symmetric 
memory. For every b € {0, 1} and every i > 1 an independent realization of the weak shared coin protocol 
is performed. An invocation of such a protocol is denoted by GetCoin,5(6, i), where 6 is a positive real 
that represents the agreement parameter of the weak shared coin (see H). GetCoin^, i) satisfies th 
efollowing conditions. Upon invocations with values b and i, it returns to all invoking processes with 
probability p > (1 — 6)/2; it returns 1 to all invoking processes with probability p > (1 — 5)/2; and, it 
returns to some and 1 to the others with probaility at most 5 f||. 

First, we prove that the protocol in Figure [l] is correct and efficient. Later we show how to implement 
the weak shared coin. 

Lemma 1 If some process decides v at timet, then, before timet some process started executing propose (v). 



Proof The proof is exactly the same of that of Lemma 1 in [1C]. V 



Lemma 2 No two processes decide different values. 

Proof The proof is exactly the same of that of case (3) of Lemma 4 in pi . V 



Lemma 3 Suppose that the following conditions hold: 

i) Mark[6 ; i] = true at time t, 

ii) Mark[1 — b, i] = false before time t, 

Hi) Mark[1 — b, i] is set true at time t' (t 1 > t), and 

iv) every invocation of both GetCoin,^, i) and GetCoin^I — b, i) yields value b. 
Then, no process sets Mark[1 — b,i + 1] to true. 
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{Initialization} 

Mark[0, 0], MARK[1, 0] <- true 

{Algorithm for process p} 

function propose(v): returns or 1 



1. myTeam p <— v; otherTeam p <— 1 - myTeam p 

2. positiorip «— 1 

3. repeat 

4. Mark [myTeam p , position p ] «— true 

5. if Mark [otherTeam p , position p + 1] 

6. tentativeN ewT eam p <— 1 — myTeam p 

7. else if Mark [otherTeam p , position p ] 

8. tentative NewTearrip <— GetCoin^ (myTeam p , position p ) 

9. else if Mark [otherTeam p , position p — 1] 

10. tentativeN ewTeam p <— myTeam p 

11. else return(myTeam p ) {Decide myTeam p } 

12. if not Mark [myTeam pi position p + 1] 

13. myTeam p *— tentativeN ewTeam p 

14. position p <— position p + 1 
end repeat 



Figure 1: n-process binary consensus for symmetric memory 



Proof The proof is essentially the same of that of the Claim included in the proof of Lemma 6 in [10]. V 

The next lemma is the heart of the new proof. The difficulty of course is that now we are using protocol 
GetCoin^, ) instead of the "global coins" of JlO|], and have to contend with the strong adversary. The 
crucial observation is that if two teams are in the same position i and the adversary wants to preserve 
parity between them, it must allow both teams to raise their flags "simultaneously," i.e. at least one 
teammate in each team must observe parity in the row of flags. But then each team will proceed to invoke 
GetCoin^(, ), whose unknown outcome is unfavorable to the adversary with probability at least (5/2) 2 . 

Lemma 4 //Mark[6, i] = true at time t and Mark[1 — b, i] = false before time t, then with probability 
at least S 2 /4, Mark[1 — b,i + l] is always false. 

Proof If Mark[1 — b, i] is always false, then it can be shown that Mark[1 — b, % + 1] is always false 
(the proof is the same of that of Lemma 2 in |fL0|l ). So, assume that Mark[1 — b, i] is set to true at some 
time t! (clearly, t' > t). Since no invocation of both GetCoiN5(6, i) and GetCoin^I — b, i) is made 
before time t, the values yielded by these invocations are independent of the schedule until time t. Thus, 
with probability at least 5 2 /4, all the invocations of GetCoin^, i) and GetCoin,s(1 — b, i) yield the 
same value b. From Lemma [|, it follows that, with probability at least <5 2 /4, Mark[1 — b, i + 1] is always 
false. V 

Theorem 3 The protocol of Figure \l\ is a randomized solution to n-process binary consensus. Assuming 
that each invocation of GetCows(, ) costs one unit of time, the expected running time per process 0(1). 
Furthermore, with high probability every process will invoke GetCoin^, ) O(logn) many times. 
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Proof From Lemma [2], if any two processes decide, they decide on the same value. From Lemma || we 
know that the decision value is the input bit of some process. We now show that all processes decide within 
a finite number of steps and that this number is polynomial both in expectation and with high probability. 

As regarding the expected decision time for any process, let P{i) denote the probability that there is 
a value b € {0, 1} such that Mark[6, i] is always false. From Lemma [|, it follows that 

P{i) > 1 - (1 - 5 2 /4) i ~ 1 i > 1 

Also, if Mark [b, i] is always false, it is easy to see that all the processes decide within i + 1 iterations of 
the repeat loop. Thus, with probability at least 1 — (1 — (5 2 /4)* _1 , all the processes decide within i + 1 
iterations of the repeat loop. This implies that the expected running time per process is 0(1). The high 
probability claim follows from the observation that pessimistically the process describing the invocations 
of GetCoin^, ) can be modeled as a geometric distribution with parameter p := (8/2) 2 . y 

We now come to the implementation of the weak shared coin for symmetric memory, which we accom- 
plish via a slight modification of the protocol of Aspnes and Herlihy ||] . In that protocol the n processes 
cooperatively simulate a random walk with absorbing barriers. To keep track of the pebble a distributed 
counter is employed. The distributed counter is implemented with an array of n registers, with position i 
privately owned by process i (that is, naming or asymmetric memory is assumed). When process i wants 
to move the pebble it updates atomically its own private register by incrementing or decrementing it by 
one. The private register also records another piece of information namely, the number of times that the 
owner updated it (this allows one to show that the implementation of the read is linearizable). On the 
other hand, reading the position of the pebble is a non-atomic operation. To read the counter the process 
scans the array of registers twice; if the two scans yield identical values the read is completed, otherwise 
two more scans are performed, and so on. As shown in ||, the expected number of elementary operations 
(read's and write's) performed by each process is 0(n 4 ). 

Since in our setting we cannot use single- writer registers, we use an array C[] of n 2 multiple- writer 
multiple-reader registers for the counter. The algorithm for a process p is as follows. Firstly, p chooses 
uniformly at random one of the n 2 registers of C[], let it be the kth. Then, the process proceeds with the 
protocol of Aspnes and Herlihy by using C[A;] as its own register and by applying the counting operations 
to all the registers of C[]. Since we are using n 2 registers instead of n, the expected number of steps that 
each process performs to simulate the protocol is 0(n 5 ). The agreement parameter of the protocol is set to 
2eS. Since the expected number of rounds of the original protocol is 0(n 4 ), by Markov's Inequality, there 
is a constant B such that, with probability at least 1/2, the protocol terminates within Bn 5 rounds. It is 
easy to see that if no two processes choose the same register, then the protocol implements a weak shared 
coin with the same agreement parameter of the original protocol in 0(n 5 ) many steps. To ensure that our 
protocol will terminate in any case, if after Bn 5 steps the process has not yet decided then it flips a coin 
and decides accordingly. Thus, in any case the protocol terminates returning a value or 1 to the calling 
process within 0(n 5 ) steps. The probability that no two processes choose the same register is 

( n 2 ) ( n 2 ) ( n? ) ~~ e 
Thus, the agreement parameter of our protocol is at least 1/2 • 1/e • 2e5 = 5. We have proved the following 
fact. 



8 



Lemma 5 For any 5 > 0, a weak shared coin with agreement parameter 5 can be implemented in the 
symmetric model (with randomization) in 0(n 5 ) steps, even against the strong adversary. 



We show next that, in contrast, no protocol exists in the symmetric model for naming, even assuming 
the availability of consensus objects and the weak adversary. 

Theorem 4 Suppose that an asynchronous, shared memory machine is such that: 

• the memory is symmetric; 

• every process has access to a source of independent, unbiased random bits, and 

• consensus objects are available. 

Then, still, naming is impossible even against a weak adversary. 

Proof By contradiction suppose there exist such a protocol. Consider two processes P and Q and let only 
Q go. Since the protocol is wait-free there exists a sequence of steps a = s\S2 ■ ■ ■ s n taken by Q such that Q 
decides on a name k a . The memory goes through a sequence of states m^mi . . . m n . The sequence a has a 
certain probability p a = P1P2 ■ ■ -p n of being executed by Q. Start the system again, this time making both 
P and Q move, but one step at a time alternating between P and Q. With probability p\ both P and Q will 
make the same step s\. A simple case analysis performed on the atomic operations (read, write, invoke 
consensus) shows that thereafter P and Q are in the same state and the shared memory is in the same 
state mi in which it was when Q executed s\ alone. This happens with probability p\ . With probability 
p\, if P and Q make one more step each, we reach a situation in which P and Q are in the same state and 
the memory state is m2- And so on, until, with probability p 2 a both P and Q decide on the same identifier, 
a contradiction. V 

Thus, naming is a necessary assumption in Herlihy's universality construction. 

3 Naming with Asymmetric Memory 

We now come to the question of whether asymmetric memory can be used to break the symmetry. First 
we show that by itself it is not sufficient. Then we show that together with randomness it allows naming 
to be solved in polynomial-time, using a key space of optimal size. 

Proposition 3 Suppose the memory is initialized fairly that is, all registers are initially set to (or any 
other fixed value). Then, if processes are identical, deterministic i/o automata without identifiers, naming 
is impossible, even if memory is asymmetric and the adversary is weak. 

Proof Consider two processes p and q that are identical deterministic i/o automata without identifiers. 
The shared memory is asymmetric: processes p and q access the n registers by means of permutations ir p 
and 7T q . That is, when processor p (q) performs a Read(i) operation, the result will be the content of the 



Corollary 1 The expected running time per process 
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register having absolute index ir p (i) (7r q (i)). Analogously for the Write (i, v) operations. We assume that 
Write (i , v) is legal only if i < n/2. That is, the local indices of the private registers are 1,2,... , n/2. We 
show the impossibility of naming even against the very simple adversary with the alternating execution 
schedule: p, q, p, q, . . . In this schedule the execution proceeds in rounds. Each round consists of a step of 
p followed by a step of q. 

We need some notions. We call the map of the contents of all the registers, shared and local, the absolute 
view. It is a function that maps each absolute index i to the content of the register of absolute index i. 
Given an absolute view V remains determined the local views L p (V) and L q (V). The local view L p (V) is 
the map of the contents of the local registers of p and of the shared registers through the permutation ir p . 
In other words, L p (V) is a function that maps each local index j to the content of the register having local 
index j w.r.t. the processor p. In particular, if j is a local index of a shared register then the local view 
maps j to the content of the register of absolute index vr p (j). Analogously for the local view L q (V). We 
have already assumed that the set of local indices of shared registers is the same for both processors (i.e. 
the set {1, 2, . . . , n}). We further assume that the set of local indices of local registers is the same for both 
processors. Thus, the entire set of local indices is the same for both processors. Therefore, the domains of 
the two local views coincide. An instant configuration, or simply a configuration, for processor p (q) is a 
pair (L, s) where L is a local view and s is a state of p (q). 

Let and si be the states of p and q, respectively, at the beginning of round t. Let Vt be the 
absolute view at the beginning of round t. We show that, for any t, if the configurations (L p (Vt),s^) and 
(L q (Vt), si) are equal then after the execution of round t the resulting configurations (L p (Vt + i), and 
(L g (Vt+i), s1 +1 ) are equal again. This fact together with the assumption that the initial configurations 
(Lp(Vb), fig) an d (LqiVo), Sq) are equal imply that the processes cannot select unique identifiers. 

Suppose that (L p (Vt),sf) and (L q (Vt), si) are equal. Since sf = si and the processes are identical 
deterministic i/o automata, both processes execute the same operation op at round t. The operation op 
can be a Read, a Write, or a local operation. 

If op is a Read(i) then, since L p {V t ) and L q {V t ) are equal, the processors read from different registers 
that have equal contents. So, the changes to the local registers are the same and the local views L p {Vt+i) 
and L q {y t+ i) are equal. Consequently, also the states and s1 +1 coincide. 

If op is a Write (i, x) then, both processors write the same value to different registers. But these 
registers have the same local index. So, the next local views L p (Vt+i) and L q (Vt+i) are equal again. 
Consequently, also the states s\ +l and s1 +1 coincide. 

If op is a local operation then, since op and the local memory is the same for both processors, the changes 
to the local memories are the same and the local views L p (T4 + i) and L q (Vt+i) are equal. Consequently, 
also the states sf +1 and s^ +1 coincide. V 

Let us now turn to our naming protocol squeeze. For now, let us assume the availability of objects 
selectWinner (i) with the following semantics. The object is invoked with a parameter i; the response is 
to return the value "You own key i!" to exactly one of the invoking processes, and "Sorry, look for another 
key" to all remaining processes. The choice of the "winner" is non-deterministic. Later we will show that 
selectWinner admits a wait-free, polynomial time Las Vegas solution in our setting. With selectWinner 
a naming protocol can be easily obtained as follows: Try each key one by one, in sequence, each time 
invoking selectWinner. This protocol, dubbed simpleButExpensive, is shown in Figure |2| Therefore we 
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protocol simpleButExpensiveO : key; 

begin 

for k := 1 to n do 

if selectWinner (k) = ''You own key k!'' then return (k) ; 

end 



Figure 2: Simple but expensive protocol for naming 

obtain the following. 

Proposition 4 Suppose that n asynchronous processes without identifiers interact via a shared memory 
and that, 

(i) Memory is asymmetric; 

(ii) Each process has access to its own source of unbiased random-bits; 

(iii) The adversary is strong. 

Then, protocol simpleButExpensive is a wait-free Las Vegas solution to naming whose running time is 
polynomial in expectation and with high probability 

The high probability statement follows from Lemma ^. 

Although the overall running time of simpleButExpensive is polynomial, given the high cost of invok- 
ing selectWinner we turn our attention to protocol squeeze which, in expecatation, will only perform 
0(log 2 n) such invocations instead of linearly many. 

In protocol squeeze the name space is divided into segments, defined by the following recurrence, where 
p is a parameter between and 1 to be fixed later: 

s k =p(l -p) k ~ l n 

To simplify the presentation we assume without loss of generality that all Sj's are integral, si is the last 
value Si such that s, > log 2 n. The first segment consists of the key interval I\ := [0, s\); the second segment 
consists of the key interval I2 ■= [s±, s\ + S2); the third of the key interval I3 := [si + S2, s\ + S2 + S3), and 
so on. The final segment consists of the last n — Y^j=x s j keys. In the protocol, each process p starts 
by selecting a tentative key i uniformly at random in I\. Then, it invokes selectWinner (i); if p "wins," 
the key becomes final and p stops; otherwise, p selects a second tentative key j uniformly at random in 
12- Again, selectWinner(j) is invoked and if p "wins" j becomes final and p stops, otherwise p continues 
in this fashion until is reached. The keys of Ig+i are tried one by one in sequence. If at the end p 
has no key yet, it will execute the protocol simpleButExpensive of Figure || as a back-up procedure. The 
resulting protocol appears in Figure ||[ 

Assuming the availability of objects of type selectWinner, protocol squeeze assigns a key to every 
non-faulty process with probability 1. This follows, because the protocol ensures that selectWinner (i) 
is invoked for every i, 1 < i < n, and each such invocation assigns a key to exactly one process. We will 
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now argue that with high probability every process receives a unique key before the back-up procedure, 
and that therefore the number of invocations of selectWinner objects is 0(log 2 n) per process w.h.p.. 

Protocol squeeze maintains the following invariant (w.h.p.). Let P& be the set of processes that after 
k — 1 attempts still have to grab a key. Their k-th. attempt will be to select a key at random in segment Ik- 
Then, |P^| « |ifc|logn 3> \Ik\ (hence the protocol "squeezes" P& into I/,)- Once the numbers are plugged 
in it follows that, with high probability, every key in Ik will be claimed by some process, and this for all 
k. Since every key is claimed w.h.p. before the back-up procedure, every process, w.h.p., receives a key 
within 0(log 2 n) invocations of selectWinner. By setting the parameter p appropriately it is possible to 
keep the number of segments small, i.e. 0(log 2 n), while maintaining the invariant \Pk\ 3> \Ik\ for each 
segment. 

Let us focus first on a run without crashes. Let pi be defined by the following recurrence 

Pi ■= {i-py~ l n. 

Clearly, |Pj| > pi. We want to show that, with high probability, |Pj| = pi, for i < I. If we can show this 
we are done because then p^ = and the protocol ensures that every one of the remaining pn process will 
receive one of the last si keys. A key k is claimed if selectWinner (k) is invoked by some process. Then, 
since there are no crashes, 

Pr[3fc € Ii, k not claimed] < ^1 — < exp j — — j < exp j — - j = 

for any fixed c > 0, provided that 

1 

p := — . 

clogn 

With this choce of p the number of segments £ is 0(log 2 n). The expected running time is therefore 

E[T{n)\ = 0(log 2 n)(l - £/n c ) + 0(n)£/n c = 0(log 2 n) 

for c> 3. 

We now argue that with crashes the situation can only improve. Let C\ be the set of processes that 
crash before obtaining a response to their invocation of selectWinner in l\. Let F\ (F as in free) be the set 
of keys of 1\ that are not claimed after processes in Pi have made their random choice. If Pi is non empty, 
assign processes of C\ to keys of Pi in a one-to-one fashion. Let f\ be this map. Then, the probability that 
a key is claimed before any process under this new scheme is no lower than in a run without crashes. We 
then set C2 to be the set of processes of P2 that crashed before obtaining a response for their invocation 
of selectWinner in I2, union C\ — f\(Ci). Again, after processes in P2 randomly select keys in I2, assign 
processes of C2 to keys of P2 by means of a one-to-one function f2- Thus, again, the probability that a key 
in I2 is claimed is higher than in a run without crashes. And so on. Thus, we have the following. 

Lemma 6 The expected number of invocations o/selectWinner per process in protocol squeeze is 0(log 2 n). 

It remains to show how to implement selectWinner in a wait-free manner in polynomial time. We will 
assume the availability of objects of type consensus (i ,b) where 1 < i < n and b G {0, 1}. Each invoking 
process p will perform the invocation using the two parameters i and 6; the object response will be the 
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same to all processes and will be "The consensus value for i is v" where v is one of the bits b that were 
proposed. By Theorem || the availability of consensus objects can be assumed without loss of generality. 
Assuming them will simplify the presentation. The protocol for selectWinner, shown in Figure ||, is as 
follows. Each process p generates a bit b\ at random and invokes consensus (1,5^). Let v\ be the response 
of the consensus object. If b\ 7^ v\ then p is a loser and exits the protocol. Otherwise, p is still in the 
game. Now the problem is to ascertain whether p is alone, in that case it is the winner, or if there are other 
processes still in the game. To this end, each remaining process scans the array W[l, i], for 1 < % < n, that 
is initialized to all O's. If W[l,z] contains a 1 then p declares itself a loser and exits; otherwise it writes 
a 1 in its private position W[l,p] and scans W[l, — ] again. If W[l, — ] contains a single 1, namely W[l,p] 
then p declares itself the winner and grabs the key, otherwise it continues the game that is, it generates 
a second bit b\ at random, invokes consensus (2,6^), and so on. The following observations and lemma 
establish the correctness of the protocol. 

Observation 1 If p declares itself the winner then it is the only process to do so. 
Observation 2 There is always a process that declares itself the winner. 

Lemma 7 With probability 1 — o(l), every process p generates O(logn) many random bits b\ and the 
number of bit operations per process is O(nlogn). 

Proof We refer to an iteration of a repeat loop of protocol selectWinner as a round, i.e. the round % 
refers to the set of iterations of the repeat loop in which the participating processes toss their private coin 
for the ith time. We assume pessimistically that the consensus object of protocol selectWinner is under 
the control of the strong adversary, subject to the following rules. Denoting with 1, . . . ,k the processes that 
perform the ith. coin toss, If < b\ = b\ = . . . = bf = or < b\ = 6| = . . . = 6* = 1 then the adversary can 
respond with consensus value or 1, respectively. Otherwise the adversary can respond with any value. 

The goal of the adversary is to maximize the number of rounds. Therefore its best policy is to return 
the consensus value that eliminate the smallest number of processes, i.e. the best strategy is to return 
the majority bit. The probability that the ith outcome of process p is the majority value depends on the 
number of processes, but it is easily seen to be maximized when there are two processes. The probability 
that the minority value is the outcome of at least 1/4 of the processes depends on the number of processes, 
but it is monotonically decreasing. Therefore the smallest value is 1/4, when just 2 processes are involved. 

We call a run successful if the minority value is the outcome of at least 1/4 of the processes. Then, 
log 4 / 3 n many successful rounds suffice to select a winner. A straightforward application of the Chernoff- 
Hoeffding bounds show that with probability 1 — o(l) at least log 4 / 3 n rounds out of 81og 4/ / 3 n many will 
be successful. 

Since every iteration of selectWinner costs 0(n) steps, the claim follows. V 

Theorem 5 Protocol squeeze is a Las Vegas, wait-free naiming protocol for asymmetric memory whose 
running time is 0(n 2 logn) with probability 1 — o(l). 
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protocol squeezeO: key; 
begin 

for i := 1 to £ do begin 

k := random key in interval ; 

if selectWinner (k) = ''You own key k!'' then return (k) ; 
end; 

for k := n - S£ to n do { tr y key in Ii one by one} 

if selectWinner (k) = ''You own key k!'' then return (k) ; 
return(simpleButExpensive() ) {back up procedure} 

end 



Figure 3: Protocol squeeze 



protocol selectWinner (i : key): outcome; 

myReg := ' 'private register of executing process''; 

attempt := 1; 

repeat 

b := random bit; 

if (b = consensus (i, b)) then begin 
scan W[attempt,j] for l<j<n; 
if (W [attempt , j] = 0, for all j) then begin 
W [attempt , myReg] := 1; 
scan W [attempt, j] for l<j<n; 

if (W[attempt, j] = 0, for all j <> myReg) then return(i); {key is grabbed!} 
else attempt := attempt + 1; {keep trying} 

else return (' 'Sorry, look for another key. ' ') ; 
end repeat 



Figure 4: Protocol selectWinner 

Remark 1: Protocol squeeze is also a good renaming protocol. Instead of the random bits, each process 
can use the bits of its own IDs starting, say, from the left hand side. Since the ID's are all different the 
above scheme will always select a unique winner within 0(\ID\) invocation of consensus. 

Remark 2: The only part of protocol squeeze that actually uses the memory is protocol selectWinner. 
In view of Proposition || this task must be impossible with symmetric memory, even if randomness and 
consensus are available. Thus, this is another task for which, strictly speaking, Herlihy's result does not 
hold and it is another example of something that cannot be accomplished by the power of randomization 
alone. 
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